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Abstract — Wireless local area networks (WLANs) based on 
IEEE 802.11 standards are becoming ubiquitous today and typi- 
cally support multiple data rates. In such multi-rate WLANs, dis- 
tributed medium access and rate adaptation are two key elements 
to achieve efficient radio resource utilization, especially in non- 
'cooperative environments. In this paper, we present an analytical 
jstudy on the non-cooperative multi-rate WLANs composed of 
selfish users jointly adjusting their data rate and contention 
"window size at the medium access level to maximize their own 
^throughput, irrespective of the impact of their selfish behaviors on 
.overall system performance. Specifically, we develop an adapted 
"Tit-For-Tat (TFT) strategy to guide the system to an efficient 
■equilibrium in non-cooperative environments. We model the inter- 
actions among selfish users under the adapted TFT framework as 
3 non-cooperative joint medium access and rate adaptation game. 
I'A systematic analysis is conducted on the structural properties 
"of the game to provide insights on the interaction between rate 
'adaptation and 802.11 medium access control in a competitive 
^setting. We show that the game has multiple equilibria, which, 
•after the equilibrium refinement process that we develop, reduce 
•to a unique efficient equilibrium. We further develop a distributed 
algorithm to achieve this equilibrium and demonstrate that the 
.equilibrium achieves the performance very close to the system 
■optimum in a social perspective. 

] I. Introduction 

'A. General Context 

' Wireless local area networks (WLANs) based on IEEE 
,802.11 standards are becoming ubiquitous today and typically 
■support multiple data transmission rates, e.g., four in 802.11b, 
.eight in 802.11a/g and 20 in 802. lln. In such multi-rate 
■WLANs, distributed medium access and rate adaptation are 
.two key elements to achieve efficient radio resource utilization. 
'In this paper, we focus on the competitive scenario where 
.selfish users jointly adjust their data rate and contention window 
(CW) size at the medium access level to maximize their 
own throughput, irrespective of the impact of their selfish 
behaviors on overall system performance. Our focus on the 
non-cooperative scenario is motivated by the following two 
observations: 

Today, network adapters are highly programmable, making 
a selfish user extremely easy to tamper wireless interface to 
maximize its own benefit, especially in environments without 
central administration such as public hot-spots or WLANs 
operated by different enterprisers; 

WLANs are by nature distributed environments lack of 
coordination and sophisticated network feedback. In such envi- 



ronments, non-cooperative selfish behaviors maximizing local 
utilities are much more robust and scalable than any centralized 
cooperative control, which is very expensive or even impossible 
to implement. 

B. Related Work 

The existing literature on the non-cooperative medium access 
and rate control in WLANs can be naturally categorized in the 
following two domains: 

Non-cooperative medium access: Recent studies Ul, ||2| have 
studied some undesirable effects of the selfish behaviors at the 
medium access level on the system performance, with the main 
result being that such selfish behaviors, even of a small number 
of users, can paralyze the entire network. Hence a number of 
mechanisms, based on punishment schemes [I], ||2l, disutility 
function |[3l and Tit-For-Tat concept IH, have been proposed 
to thwart the selfish behaviors and drive the network to an 
efficient equilibrium. Another relevant research thrust Q, ||6l, 
I?], consists of applying game theory to analyze and reverse- 
engineer the 802.11 MAC protocol and providing insights on 
designing more efficient wireless MAC protocols. 

Non-cooperative rate adaptation: Since 802.11 standards do 
not specify any rate adaptation algorithm, extensive research 
efforts have been investigated in this field, resulting a number of 
rate adaptation mechanisms, ranging from the seminal work on 
automatic rate fallback (ARE) approach [8] to later developed 
algorithms to improve and replace ARE, e.g., ||9l, ifTOl . ifTTl . 
The key idea is to track channel quality based on packet 
losses and adapt data rate accordingly. More recently in non- 
cooperative setting, related works ifTSl . lfT3l . lfT4ll . ifTSl have 
discussed some undesirable effects of the 802.11 MAC layer 
distributed coordination function (DCE) on the overall network 
performance when multiple competing nodes use different data 
rates. Specifically, llT2l demonstrates the so-called performance 
anomaly in 802.11 WLANs such that when competing nodes 
transmit at different data rates, the aggregate throughput is 
dominated by the lowest transmission rate. Tan et al. (131 
investigate the time-based fairness, in which each node is given 
an equal amount of channel time, and the throughput-based 
fairness, in which each node achieves equal throughputs and 
show via experiments that time-based fairness can improve 
performance in multi-rate WLANs. Rate adaptation games 
in WLANs without and with loss distinction are studied in 
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lfT6ll . Pricing mechanisms are proposed in ifTTll to increase the 
efficiency of the equilibrium of the non-cooperative rate control 
game. 

C. Paper Overview and Contributions 

Despite a rich body of existing work, the vast majority of 
them addresses the non-cooperative behaviors at medium access 
and rate adaptation levels separately. The following natural 
but crucial questions arise: What is the situation if users can 
jointly configure their medium access and rate level strategies 
selfishly? How the two levels interact with each other? How 
to orient the system towards a fair and efficient equilibrium in 
this two-dimensional competitive scenario? 

Motivated by the above observation, this paper provides 
a systematic analysis on the non-cooperative joint medium 
access and rate adaptation game in 802.11 WLANs under the 
adapted Tit-For-Tat (TFT) framework, a natural strategy in 
non-cooperative environments widely applied in many appli- 
cations such as peer-to-peer networks ifTSl . We adapt the TFT 
strategy at both medium access and rate adaptation level (the 
authors of 01 study the TFT strategy only at medium access 
level for homogeneous users where the rate adaptation is not 
considered). The adapted TFT strategy does not require any 
coordination or incentive mechanisms which may be expensive 
or even impossible to implement in distributed environments 
as 802.11 WLANs and inherently ensures user fairness at both 
levels. 

Aiming to provide in-depth understanding of the interaction 
between rate adaptation and medium access in competitive 
setting from an analytical perspective, we analyze the structural 
properties of the formulated game. Specifically, we show that 
the game has multiple equilibria, which, after the equilibrium 
refinement process that we develop, reduce to a unique efficient 
equilibrium. We further develop a distributed algorithm to 
achieve this equilibrium and demonstrate that the equilibrium 
achieves the performance very close to the system optimum in 
a social perspective. 

From a user-centric perspective, our work also provides 
an analytical framework that can stabilize the network in a 
distributed fashion around a system equilibrium with fairness 
(or service differentiation) and high efficiency. 

II. System Model 

We consider a multi-rate 802.11 WLAN of a set TV = 
{1, • • • , n} of selfish (but not malicious) users, each of whom 
tries to maximize his own utility (e.g., throughput) by conduct- 
ing the following two-dimensional selfish strategies: 

Medium access: Each user i selfishly modifies the distributed 
random back-off medium access mechanism imposed by 802. 1 1 
DCF so as to maximize its utility. More specifically, we focus 
on the modification of the key parameter in DCF, the contention 
window (CW) size, denoted by Wi. 

Rate adaptation: Each user i selfishly configures its data rate 
Ri, given the channel condition and other parameters, so as to 
maximize its utility. Note that IEEE 802.11 standards support 
multiple transmission rates and leave users to implement their 



rate adaptation algorithms, which essentially consists of seeking 
a tradeoff between the transmission rate and the packet loss due 
to channel errors. 

III. Markov Model of a Multi-rate 802. 1 1 WLAN 
WITH Selfish Users 

A. The Markov Model 

To model the users' selfish behaviors and to capture their 
impact on the network performance, we develop a Markov 
model on the exponential back-off process by taking into 
account the users' selfish behaviors. Our model follows the 
ideas of the seminal work of Bianchi lfT9l . and is based on the 
same assumptions: 

• Traffic saturation: The network is saturated such that all 
users have packets to transmit at any tim^H; 

• Decoupling of collision probability: The collision proba- 
bility is decoupled from the back-off stage; 




q,/(2"'W,) 



Fig. 1: The Markov model of user i 

The Markov model of user i selfishly operating on Wi and Ri 
is depicted in Figure [T] Each state in the chain is denoted by a 
couple (/, k) with / = 0, • • • , m being the back-off stage and k 
the back-off counter (fc < 2' — 1). The CW value doubles after 
each consecutive colhsion until when the maximum back-off 
stage m is achieved. State transitions happen at the beginning 
of each virtual slot, as defined in fi9\. Despite the similarity 
between our model and the Bianchi's model on standard 802. 1 1 
DCF, there are two key differences that make the sequential 
analysis more challenging: 

Heterogeneity: The selfishness at both medium access and 
rate adaptation level leads to heterogeneous Markov models 
among users. Hence the resulting stationary state solution is 
heterogeneous among users, rather than being identical for all 
users as in lfT9l : 

Impact of data rate: Different from |19 |, the stationary state 
of the Markov model is jointly determined by the medium 
access parameters and data rate. As shown later in the paper, 
this cross-layer interdependence brings non-trivial difficulties in 

' This assumption is justified in our context as the network is saturated when 
selfish users try to maximize their share of bandwidth by depriving other users 
of their share. 
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the analysis of the system equiHbrium and requires an original 
study that cannot rely on any existing results. 

Denote the transmission probability of node i in a random 
slot and pi the conditioned collision probability of i (the 
collision probability when i transmits a packet in a random 
slot), recall the decoupling and saturation assumptions, we can 
establish the state transition probabilities as follows: 
P{l,k\l,k + 1} ^1, 0<fc<2'Wi-2, 0<l<m, 
1 - 9i 



P{0,k\l,0} 
P{l,k\l- 1,0} 
P{m, k\m, 0} = 



ft 



2"W, 



< fc < - 1, < / < m, 
, < fc < 2'Wi - 1, l<l<m, 
, < fc < 2"'W, - 1, 



where -P{si, 6i|s2, 62} denotes the transition probability from 
state (si,6i) to (52,^2), ft denotes the probability a packet 
arrives at destination without error, detailed as follows 

ft = l-(l-pO[l-e,(i?,)], (1) 
where ei{Ri) is the packet error rate such that a packet is 
lost due to channel error when user i operates on data rate 
Ri. The four formulas describe the transition probabilities 
corresponding to the following scenarios under DCF: 

• i decrements its CW at the beginning of each slot; 

• Once i finish a successful transmission, the next packet's 
back-off timer is selected uniformly from the range 
[0, Wi — 1], corresponding to the first back-off stage; 

• In case of a transmission failure, the back-off stage is 
incremented and the back-off timer is selected uniformly 
from the range [0, 2'Wi - 1]; 

• Once in stage m, the back-off stage is no more incre- 
mented. 

B. Analysis of Stationary State Solution 

To derive the stationary state of i, let bi^k — 
limt^oo P{s{t) = I, b{t) ~ k} denote the stationary proba- 
bility distribution, we can derive the following equations to 
express &;,fc by 60,0: 

hfi = ft&o.o, < I < m - 1, 



bm,0 — 



ft 



■^0,0, 



2/m-_k 



bLo, 1 < fc < 2Wj ~ 1, < I <m. 



Apply YdLo Efcio' ^ ^i,fc = 1' we can solve 60,0 as 



bo.o 



2{l ~ 2q,)il ~ q,) 



(1 - 2q,)iW, + 1) + q,W,il - (2g0™) ' 
Thus, the probability ti that user i transmits in a random slot 
can be expressed as 

^ 2 

n = 2^ bi.o ^ Z7— „. ^rr,.-^,^ w - (2) 



On the other hand, we have 

Combine ([TJ, ^ and ([3]l, we obtain 3n equations with 3n 
unknowns. In Theorem [T] that follows, we prove that given 



any data rate profile {Ri)ieJ\f, the system characterized by 
the 3n equations admits a unique solution under the mild 
condition Wi > 3,Vi € Af. These equations can then be solved 
numerically. 

Before presenting Theorem [T] we define the following func- 
tion; 

2 



1 + xW,J:Z^\2x) 



(4) 



The following properties hold straightforwardly: 
> Ti = Ti{qi)- 

. From ([B, @ and it holds that 

(i-,.)(i-r.(ft))_ 



(5) 



l-e,(i?,) 

> By checking the derivative, it can be shown that (1 — a;) [1 — 
ri(x)] is monotonously decreasing in x if Wi > 3. 

Theorem 1. Under the condition Wi > 3, Vi S M, the Markov 
model characterizing a 802.11 WLAN with selfish users admits 
a unique stationary-state solution for any data rate profile 

Proof: We first show that the Markov model has a station- 
ary state solution. By injecting pi into qi, we obtain 

ft = 1 - (1 - e,(i?,o) n 

Recall (|2|i and denote T_i = {Tj,j G Af,j 7^ i}, Ti can be 
regarded as a function of r_i. To prove the existence of a 
stationary state solution in the Markov chain model, it suffices 
to show that the mapping from T_i to r,;, described as follows, 
has a fixed point: 

2 

Ti = Ti{T^i) ^ 



l + q:W,ET=ro\2q,y 



where q, = 1 - (1 - e,{R^)) UJe^f.3M^ ^ '^J')' 

Noticing that < tj < 1 holds for any j E Af and that 
< e,(i?j) < 1, it holds that 

2 2 

r^(T_,) < , < — — r < 1, 



T,(t_,) > 



m — 1 

W, + 1 + e,{R,)W, [2e,{R,)f 
1=0 

>0. 



W, + l 



W, + l 



m—1 

w, J2 2' 



It follows from Brouwer fixed point theorem [20| that there 
exists a fixed point to the mapping {Ti)i^^. 

We then proceed to show the uniqueness of the stationary 
state solution. Assume, by contradiction, that there exists 
two distinct stationary points = {t} ,p\,ql,i G N) and 
s-*- = (rf ,pf , , i e N), their must exists i such that q] ^ qf. 
Without loss of generality, assume that qj < q^. 

Note that (|5]l holds at both solutions and that (1 — a;)[l — 
ri(a;)] is monotonously decreasing in x if Wi > 3, it follows 
from the assumption qj < qf that rijgArll-T/) > DjeA^ll- 
tJ), which, by applying (|5]) to other users j, leads to < q^. 
Noticing that tj = is monotonously decreasing in qj, 

we have tJ > rj for any other user j, which, combined with 
the assumption ql < qf (thus > Tf), clearly contradicts 



4 



with UjeAfi^ - > Ujejvi^ - )■ thus complete the 
proof of the uniqueness of the stationary state solution of the 
Markov model and also the theorem. ■ 



IV. Adapted Tit-For-Tat Strategy 

First introduced by Anatol Rapoport in Robert Axelrod's two 
tournaments, Tit-For-Tat (TFT) strategy II2TI is regarded as one 
of the best strategies in non-cooperative environments and is the 
root of an ever growing amount of other successful strategies. 
The core idea of TFT is to start with cooperation and continue 
to cooperate if the opponents cooperate. The philosophy behind 
is that in selfish environment each rational player is expected 
to take more aggressive actions if and only if any other player 
acts more aggressively. 

In our context, we propose the following adapted version of 
the TFT strategy at two levels. 

At the medium access level, the TFT strategy is adapted to 
ensure that all users operate on the same CW values in order 
to guarantee the medium access fairness among users. More 
specifically, each user i measures the CW values of other users 
during a period of tim^ If it detects another user j operating 
on a smaller CW value (i.e., Wj < Wi), then it reacts by 
setting Wi = min j^j\f Wj, otherwise it keeps operating on the 
previous CW value. 

At the rate adaptation level, the TFT strategy is adapted to 
ensure the fairness among users in terms of channel occupation 
time, i.e., all users occupy the channel for the same amount 
of time when transmitting a packet. More specifically, each 
user i measures the channel occupation time of other users 
during a period of time. If it detects another user j occupying 
a longer period of time for transmission (i.e., Tj > Ti), then 
it reacts by setting Ti = ma.Xj^^ Tj, otherwise it sticks to 
the previous strategy. The rate adaptation level TFT strategy is 
essentially motivated by the well-known performance anormaly 
in 802.11 WLANs where a selfish user tends to transmit at 
lower data rate so as to enjoy better transmission quality while 
it penalizes other users since a lower data rate increases its 
channel occupation time. 

The above adapted TFT strategy has following desirable 
properties which makes it especially suited in WLANs: 

• The decision is made solely on local measurement. 

• It is simple to implement and only the last measurement 
needs to be stored. 

• It is especially adapted in wireless networks in that the 
broadcast nature makes the observation feasible in promis- 
cuous mode. 

• It ensures the fairness among selfish users. 

In practice, taking into account the various factors influenc- 
ing the measurement in wireless environment, a more tolerant 
version of the TFT strategy called Generous TFT (GTFT) can 
be applied by integrating a tolerance marge. 

^How to observe average CW values during a period of time in a saturated 
WLAN is addressed in (S). 



V. Joint medium access and Rate Adaptation Game 
Formulation 

We focus on the competitive scenario that each user self- 
ishly attempts to optimize its performance by jointly selecting 
appropriate strategy (CW value and data rate) under the adapted 
TFT framework depicted in the previously section. This setting 
gives rise to a non-cooperative joint medium access and rate 
adaptation game. In this section, we first specify the utility 
function of each user and then give the formal definition of the 
game. 

A. User Utility: Throughput Analysis 

We study a natural utility function for selfish users, the 
effective throughput, defined as the quantity of bits per unit time 
successfully arriving at the destination. In this subsection, we 
use the stationary state solution of the developed Markov model 
to derive the expected effective throughput of each user i. We 
set out by computing the average virtual slot duration, denoted 
as Tsiot- As defined in [21], a virtual slot may correspond to a 
slot where the channel is idle, to a successful transmission, or to 
a collision. Specifically, let a denote the duration of an empty 
slot, denote the duration of a successful transmission of user 
i, and T^* the duration of an unsuccessful transmission of user i 
due to either collision or channel error. We can mathematically 
develop Tsiot as 



lot 



(6) 

where the three terms represent, respectively, the following 
possible scenarios: 

« with probability Y\ji^j^{^ — Tj), the system experiences an 
empty slot; 

. with probability n WjeM.j^ii^ ^ ~ ei{Ri)), i has a 

successful transmission; 
• with probability TiYlj^f^ j^ii^ — Tj), i experiences a 
collision, with probability nY[j(zj^j^i{i - Tj)ei{Ri), i 
experiences a transmission failure due to channel error. 
In the base-Une 802.11 MAC protocol without RTS/CTsEl 
the TFT framework ensures that the packet duration is identical 
among users. Denote T the time for a packet transmission, de- 
note ACK, DIFS and SIFS the time to transmit the ACK DIFS 
and SIFS frames, respectively. By neglecting the propagation 
delay, we can compute and as follows 

tI = T + SIFS + ACK + DIFS ~ T, 

SIFS ■ 



V, = T 



where the approximation is due to that the control frame size 
is order of magnitude smaller than that of a data frame. 

^^Altliougli we focus in tliis work on tlie base-line model, our method and 
analysis can be applied in the model with RTS/CTS dialogue, which we leave 
for future study. 
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Let Pi ^ 1 — Ti, noting that cr <C T, we have 

Tsiot - (1 - n Po)^- 

Note that (1) the probability that user i performs a trans- 
mission without colHsion is (1 — Pi)Y\j(^j^ Pj\ (2) the 
transmitted packet, whose size is RiT, is corrupted due to 
channel error with probability ei{Ri), after some algebraic 
operations, the effective throughput can be written as 



To conclude this subsection, we take an analytical look at the 
packet error rate function ei{Ri). Intuitively, under the adapted 
TFT framework, each user should strike a balance between 
sending more bits in the packet transmission time by choosing 
a higher data rate but at the price of increasing the transmission 
error and operating at a lower data rate with less transmission 
error Mathematically, such tradeoff at the rate adaptation level 
is modeled as follows. Assuming perfect error detection and 
no error correction, we can express ei{ri) as ei{Ri) = (1 — 
Pe{Rr))^'^, where Pe{Ri) is the bit error rate (BER). is a 
function of Eb/No, the bit-energy-to-noise ratio of the received 



signal, e.g., Pe 



-Et/No fQj. DpsK. Assuming an additive 



white Gaussian noise (AWGN) channel, the bit-energy-to-noise 
of the received signal is derived from the SNR 



No 



ratio of i 

(Signal-to-Noise Ratio) as follows: 

h^P^ Bt 



Ef] ^SNR^ 



<T^ R. 



h,Bt P. 

^2 Q' 



where Bt is the unspread bandwidth of the signal. Pi is the 
transmission power of i, hi is the channel gain from i to the 
receiver and is the AWGN power at the receiver 

Generally, define Gi{Ri) = (1 — ei{Ri))Ri, we have the 
following features on ei{Ri) and Gi{Ri) with practical setting 
on the packet transmission duration T and the minimal and 
maximal operational data rate Rmin.i and Rmax,i- 

Lemma 1. It holds that: 

• ei(0) 0, ei(oo) 1; ei{Ri) is monotonously increas- 
ing, twice derivable and strictly convex in Rmin,i < Ri < 

RmaXjit 

• e'i{Ri) > and is monotonously increasing in Ri; 

• Gi(0) = 0, Gi(oo) — > 0; Gi{Ri) is strictly concave in 

R-muiji — Ri — Rmax,i' 

B. Game Formulation 

We model the interactions among selfish users as a non- 
cooperative joint medium access and rate adaptation game, in 
which each user selfishly maximizes its effective throughput by 
jointly selecting its CW value and data rate under the adapted 
TFT framework. The game is formally defined as followO 

Definition 1. The non-cooperative joint medium access and 
rate adaptation game, denoted as G, is defined as a triple 
G = {M ,{Wi,Tl.i)i^j^/,{Si)i^j^/), where N is the set of 

"^Throughout the paper, we use player and user interchangeably to denote a 
wireless node in the studied WLAN. 



players, (Wi,72.i) is the strategy set of player i, where Wi = 
{0, • • • , oo} and TZi = [Rmin,i, Rmax,i], the effective through- 
put Si is the utility function of player i. 

The solution of the game is characterized by a Nash equi- 
librium ||231 (NE, simply noted as equilibrium in the paper), a 
strategy profile from which no player has incentive to deviate 
unilaterally. It can be noted that the challenge in solving G is to 
solve a two-dimensional and non-decomposable optimization 
problem for each player. To overcome this difficulty and to 
get more insight on the structure of resulting equilibrium, we 
introduce the following two-level hierarchical game model: the 
lower-level rate adaptation game under fixed CW setting and 
the higher-level medium access game. 

Definition 2. Given a fixed CW setting W ^ {Wi)i^^f at 

the medium access level, the non-cooperative rate adaptation 
game, denoted as GrCW), is a tuple {J\f, {TZi)i^j\f, {Si)i^j\f), 
where J\f the player set, TZi = [Rmin,i, Rmax,i\ is the strategy 
set of player i. Si is the utility function of i. Each player i 
selects its data rate Rmin.i l£ Ri l£ Rmax.i to maximize Si. 

Definition 3. The non-cooperative medium access game, de- 
noted as Gm, is a tuple {Af, {Wi)i^^f, (S'i)igA^), where N the 
player set, Wi = {0, • • • , oo} is the strategy set of player 
i. Si is the utility function of i, defined as Si{Wi^W-i) =^ 
5i(W, R*(W)), where R*(W) denotes the equilibrium of 
Gr{W) (i.e., the data rate profile at the equilibrium of the rate 
adaptation game under W). Each player i selects its strategy 
Wi € Wi to maximize its utility function Si. 

By decomposing G, we introduce a two-level hierarchical 
architecture which will help us analyze the two-dimensional 
joint medium access and rate adaptation problem in the non- 
cooperative setting, as explored in the next two sections. 

VI. Equilibrium Analysis of Lower-level Rate 
Adaptation Game 

In this section, we solve the lower-level non-cooperative rate 
adaptation game in which each user adapts its data rate to 
maximize its effective throughput under fixed CW setting. We 
study the existence and uniqueness of equilibrium and explore 
some structural properties of the equilibrium. We focus our 
attention to scenarios where users play pure strategies (do not 
randomized their data rate). We obtain the following results: (1) 
a pure NE exists and is unique; (2) we establish the convergence 
to the unique NE; (3) we investigate the efficiency of the NE 
in terms of Price of Anarchy (PoA) to gain more insights into 
the system behaviors. 

To make the analysis tractable, we make the following 
assumption (or approximation): the number of users is large 
enough so that the strategy change of one user has neglectable 
influence on the system state. The approximation is accurate in 
large systems where the micro-level strategy variation of any 
individual player has neglectable influence on the macro-level 
system state. From a game theoretic perspective, we consider 
a non-atomic gam^ 

^Please refer to |24) for a detailed presentation on non-atomic games. 
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Particularly in this section, the assumption is mathematically 
expressed as the following approximation: 

Up^^ n Pr (7) 

Lemma |2] follows immediately from the non-atomic game 
assumption. The proof, consists of checking the relevant deriva- 
tives and applying Lemma [1] and the equations ([T), (|2|, (O and 

Lemma 2. Under non-atomic game assumption, we have 

* Pi is monotonously decreasing and convex in Ri; 

• ^ is negative and monotonously increasing in Ri. 

Proof: It follows from the non-atomic game assumption 
that Pi = njeA/-j5^i(l - 'Tj) is independent to i?,,. As ei{Ri) is 
monotonously increasing in Ri (Lemma [U, it follows from ([T]i 
that Qi is monotonously increasing in Ri. It then follows 
from (|2| that Ti is monotonously increasing in Ri. Hence 
Pi = 1 — is monotonously decreasing in i?^. ■ 

A. Best-response Function 

At an equilibrium of Gn, if there exists, the data rate of each 
player i consists of the best response given the strategy setting 
of others. Formally, the best-response correspondence of player 
i expressed as a function of the strategies of other players R-i, 
denoted as Bi{R^i), is defined as follows 



Bi{R-i) 



max 

i<Ri<R„ 



Si{Ri, R-i). 



(8) 



We prove in this subsection that in the rate adaptation game 
Gr, Bi{R^i) is uniquely determined by Hence Bi{R^i) 

can be defined as a function of R-t. 

We start by investigating the derivative of the utility function. 
It can be noted that Si {Ri , R-i) is continuous and differentiable 
in Ri. After some straightforward mathematical development. 



we can write 



as, 
an. 



as follows: 



dR, 



il^P^)G,{R,)U,e.^.,^^Pl 

^-UjeAfP] 



G',iR^) 
G,{R,) 



1 ~ IljgA/'.j^iPj dpi 

Let Ai{Ri) denote the term in the parenthesis of 

.^ G[{Ri) ^-Y{,en,mPi dp, 

^ G^{R^) {l-p.){l-U,eAfP,)dR^- 

Noticing that = 1 — and ([1]), we have: 

dpi _ pTi dqi 

dR, ^ ~ 



(9) 



dqi dR 
It then follows that 

G[{Ri) 



— TT 

dq. Ai 



Pje[{R, 



Proof: We can develop the second term of Ai{Ri) as: 
1 ^ TljeM-j^i Pi dpi _ I 1 T{jeN',j=ti Pj 



{l-p,){l-Y{,eMP,)dR 
dp. _ {^^''ATh±. 



\1-Pi I- PtUjeATj^iPj ^ 

p^) aiog(i-p,) \ dp. 



dR,. 



dpi 



9 log 



dpi 



dR, 
dpi 



dpi 



It can be checked that since pj < 1, log 



dR, 
1 

l-p^ 



is monotonously increasing and convex in < pi < 1. Since 
Pi is convex and monotonously decreasing in Ri (Lemma |2]i, 

9 log "jgA-ff' f^ 

dp aR" i^ '■^'^^ monotonously increasing in Ri. 

On the other hand, it follows from Lemma [T] that = 
(log Gi{Ri))' is monotonously decreasing in Ri. Hence Ai{Ri) 
is monotonously decreasing in i?;. ■ 

Based on Lemma |3] we can drive the maximizer of Si by 
distinguishing the following three cases: 

1) A,{R,n,n,t) < 0. It follows from Q that If- < 



for i?„ 



< Ri < Rn 

R„ 



Si{Ri) is monotonously 
with a unique maximizer 



decreasing in [R^ 
being RminX, 

2) A,{R,nax,i) > 0. It follows from Q that ff; > 
for Rmin,i < Ri < Rmax.f Si{Ri) is monotonously 
increasing in [Rmin,i,Rmax.i] with a unique maximizer 
being Rmax,i\ 

3) A,{Rmins) > and Ai{Rmax.t) < 0. There exists R* 



such that If- > for R. 

for R* < Ri^< Rr, 



mm,« < i?^ < i?* and < 
R* is thus the unique maximizer 



of 



The above analysis shows that the best-response in Gr is 
indeed a function that can be expressed as follows: 



B,iR-,) 



Rjnin^i if -^iiRmin.i) — 0; 
1-1/ 



.) < 0, (12) 



1 - I\Je^^,j^i Pj) Ilje^f.J^^ Pj dr, „ , 

--^ei(Ri) 



^^^■'^ G,(i?0 ■ il-P:)il-U,eJ^P,) 
Lemma 3. Ai(Ri) is monotonously decreasing in R 



'Generally speaking, the best response strategy of a game is not necessarily 
a function (one-to-one mapping), i.e., there may exist several global maxima 
of the utility function. 



A-'{0) if A,{R„nn..)>0,A,{R„ 
Rmax,i if -^iiRmax.i) ^ 0- 

Defined B(R) = {Bi{R-i))i^_\f, we show in Lemma|4]that 
B(R) is non-decreasing in RjjVj G Af,j 7^ i. 

(10) Lemma 4. B(R) is non-decreasing in Rj , Vj G M, j 7^ i. 

Proof: Recall the best-response function ( fT2] ). let R* 
denote the root of the equation Ai{Ri) = where Ai{Ri) is de- 
fined in ( fTOb . it suffices to show that if Rmin.i < R* < Rmax,i, 

then i?* is increasing in Rj,yj G J\f,j 7^ i. We proceed our 
proof by contradiction. Assume, by contradiction, that Rf and 
R'l satisfying Ai{R,) = such that Rf < R\ and i?° > R), 



We first show that under the assumption, it holds that < 
p\. We proceed by distinguishing the following two cases: 
> Case 1: HjeArP^ — IljeTV^ Pj- follows from eq 18 and 
the assumption i?f < R\ that qf < q\. It then follows 
from Q that rf > . Hence p°; < p\. 
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• Case 2: IIjgA^ P'j ^ IljeA^ P^j- follows from eq 18 and 
the assumption i?" > i?j,Vj & N,j ^ i that > q'-. It 
then follows from Q that Tj < rj". Hence pj > p]-,^] G 
N,] ^ I. It then follows n^eAA P" < UjeM Pj that pf < 

pI 

We next show that under the assumption, it holds that 

n,eA/-,,#.P? > n,6^jy.P?- Otherwise, if U,e^,,^.P'^ > 
YijeAf j^i p'j^ then recall the assumption on the non-atomic 
game (equation dTJi), it follows from (|5]l that 

(l-g°)(l-r,(g^")) _ ^ (l-g^)(l-r,(g^ )) 



Now recall that and R\ is the maximizer of Si is the range 
[^min.i, -Rmaa;,i], noticing the relationship between J^, we 
have 

> i?,^ => R1 > R^rmna => A{R1) > 0. 

We thus have Ai{R^) > 0, hence we must have i?^ = Rmax,i, 
which contradicts with the assumption Rf > R\ since i?^ is 
upper-bounded by Rmax,i- Hence we have Rf < R\. Recall 
the assumption that JljeAAPi - Y\jeM P'] Pi > Pi' tt 
follows from (HJ, ©, © and = 1 - n that < p^, which 
contradicts the assumption pf > p\. We have thus shown that 



< 



P, 



-|-|- It then holds that r° > rj'jVj e A/", it then follows from © 

= 11 Py Vj eAf,j ^ I. jjj^j g^. ( ™) < g^. (-^h)^ leading to < Ej. If R"" = R'',yj e 



l<£Af,l^i 



Since ej{Rj) is increasing in Rj and we have shown that (1 — 
— rj(gj)) is decreasing in qj, it holds that < 9j, Vj' G 
A/", j 7^ I, leading to p" > p^yj G A/",.?' 7^ i, which contradicts 
to the assumption that Uje^^,J^^ Pj > UjeM,]^, p]- 

Until now we have shown that pf < p\ and HjeA^ j^i Pj — 
Yljej\f,j^i P^j- Now recall the atomic assumption, we can derive 
from dlB that 

A (T.^ 9'i{Ri) YheNJ^ifldn , 

Ai[Ri) - , , H —ei(Ri). 

gi{Ri) I- Pi oqi 

Noticing the assumption Rf < R\, it holds that Ai{Rf, Rt^) > 

Ai{R'l,R^_^), which contradicts with the fact both Rf and 

R\ satisfy Ai{Ri) = 0. This contradiction shows that the 

best-response mapping Bi{R^i) is non-decreasing in Rj,yj G 

Af,j^i. m 

B. Equilibrium Analysis: Existence, Uniqueness and Conver- 
gence 

Armed with the analysis on Bi{R^i), we now show that Gb. 
has a unique equilibrium. 

Theorem 2. Given any CW setting W, the non-cooperative 
rate adaptation game GrCW) admits at least an equilibrium. 

Proof: Recall Lemma |4] and that Bi{R^i) is bounded 
such that Rmin,i < B^{R^i) < Rmax.i, starting by = 
R7nin,i)i£Af, the bcst rcsponsc mapping must converge in a 
monotonously non-decreasing fashion to a fixed point which is 
an equilibrium of the game. ■ 
Theorem [3] further establishes the uniqueness of NE. 

Theorem 3. Given any CW setting W, the non-cooperative 
rate adaptation game Gji{W) admits a unique equilibrium. 

Proof: We prove the theorem by contradiction. Assume 
that two distinct equilibria a and b exist. Without loss of 
generaUty, assume that Yl^^j^ p'- < UjeAfPr 

We first show, by contradiction, that p" < Pp^j G TV. 
Assume that there exists i ^ Af such that pf > p\. We show 
that Rf < R\. To this end, assume, by contradiction, that 
Rf > R\. We have shown in the proofs of the previous two 
theorems that Ai{Ri) is monotonously decreasing in Ri and pi, 
monotonously increasing in pj ( j G N ^ j 7^ i). It then follows 
from the assumption that 

A,{Rf) < A.,{R\). 



M, then recall Theorem [T] it holds that the two equilibria a 
and b are identical, which contradicts to the assumption that 
they are distinct equilibria. Hence there must exist / G N such 
that Rf < R 



We distinguish the following two cases 

R„iax.i- It holds that > 0. If Rf 

i?™„,,,then AiiRf)< O.lfRf > R^.^.i, then Ai{Rf) 
0; 



Case 1: R\ 



Case 1: Rf < R 

b 



Since R'f > Rf > R„ 



it holds 



that Ai{R^) = 0. If Rf = Rmin,i, then Ai{Rf) < 0. If 
Rf > Rrmn.u then Ai{Rf)=:0. 
In both cases, we have Ai{R'f) > Ai{Rf). 

On the other hand, recall the atomic assumption, we can 
derive from (fTTI) that 

g'liRi) , UjeAfj^iPj dri 



MRi) 



9i{Ri 



dq 



■e[{Ri). 



For each i e J\f,we have shown that Rf < R\ (with Rf < R\), 



< P-, Vi G TV. Hence 



-Pi 



-Pi 



It also 



follows from p^ = 1 — t; and (|2]l that qf < q\. Noticing that 
e'i{Ri) > and is monotonously increasing in Ri (Lemma [TJ 
and that t; is concave and monotonously decreasing in qi 
meaning that > and is monotonously decreasing in qi, it 
holds that 

TheAfj^iPldn 



1 - p, dqi 
Moreover, since gi{Ri 
cave in Ri. Hence 



> 



Pj dn 



1 - Pi dqi 
is concave in Ri, log gi{Ri) is con- 
is decreasing in Ri. 



ai{R. 



dloggijRi 
dRi 



Therefore, we have < Ai{Rf), which contradicts with 

Ai{R\) ^ -^i{Ri)- This contradiction completes our proof on 
the equilibrium uniqueness. ■ 
The following theorem follows naturally, establishing the 
convergence to the unique equilibrium of GnifW) under any 
initial state {Ri{Q))ii^j^ in an asynchronous manner 

Theorem 4. Assume that the players of Gr(W) follows the 
best response dynamics from some initial rate vector R(0) = 
{Ri{0))ii=j\f, i.e., from time to time, an asynchronous update 
step is taken where some user i G Af updates its strategy 
from Ri{n) to Ri{n + 1) = Bi{Ri{n)), and the sequence of 
players doing the update steps can be arbitrary as long as the 
number of steps between consequent updates of every individual 
player is bounded. Then, lim„_>.oo R(n) = R*, where R* is 
the unique NE of Gr ( W). 
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Proof: First, consider an arbitrary sequence of update steps 
commencing from an initial vector R(0) — {Rmin,i,i G A/"), 
and denote the resulting sequence of rate vectors by Rmi„(n). 
Obviously, for any player i, the first time it updates its strategy 
will be a nondecreasing update. In light of Lemma H] it 
follows by induction that all updates must be nondecreasing, 
i.e., Rmi„(n) is a nondecreasing sequence. Since Rmj„(n) < 
Umax is bounded, it must converge to a limit. Due to the 
continuity of the best-response function, this limit must be its 
(unique) fixed point R*. 

In a similar manner, consider a sequence of best-response 
updates from an initial vector of R(0) = {Rmax,i,'i <= A/"). 
By the same analysis. Lemma |4] implies that all the updates in 
the sequence must be non-increasing, and the sequence must 
therefore converge to R*. 

Finally, consider a sequence of best-response updates R(n) 
commencing from an arbitrary initial data rate vector R(0). 
From LemmaH] it follows that R,„i„(n) < R(n) < R,„aa;(n), 
provided that for every n, the update step is performed by the 
same flow in all three sequences. Since, as established above, 
Rmm('T-) and Umaxin) convcrge to R*, it follows that the 
same is true for R(n) as well. ■ 



C. Equilibrium Efficiency Analysis 

Having derived the unique NE of Gi^(W), we proceed to 
evaluate the efficiency of the unique NE compared with the 
global optimal by quantifying the Price of Anarchy (PoA), 
[21], defined as the ratio between the optimal global utility 
and the system utility achieved at the NE. In other words, 
PoA quantifies the efficiency loss due to selfish competition 
compared to cooperation. Due to the complexity of the problem 
and in order to make our analysis tractable, we focus on 
the symmetrical and unconstraint scenario where Gi{Ri) are 
identical for all players and (Rmax, respectively) are 

sufficiently small (large) so that the unique equilibrium satisfies 
= 0. However, as shown in Section IVIIII by simulation, 
we observe the same results for general cases. 

We first show in Theorem |5] that in the symmetrical case, the 
equilibrium is also symmetrical such that it holds that R* = 
R*,p* = p*yi,j eAf. 

Theorem 5. In the symmetrical case, the equilibrium is also 
symmetrical, i.e., R* ~ Rj,P* = Pp^hj S A/". 

Proof: We first prove R* = i?*,Vi,j £ Af. Otherwise 
if there exist i, j such that R* ^ R*. Since the players 
are symmetrical, by swapping R* and R* we obtain another 
equilibrium. This clearly contradicts with Theorem [3] on the 
uniqueness of the equilibrium. 

Noticing that are identical among players, it then follows 
immediately from 18 that r* = T*,Vi, j e N, leading to p* = 
P*,Vz,jeAA. ■ 

Armed with Theorem |5] the equilibrium can thus be solved 
numerically by imposing Ai{Ri) = and combining the 
equations ([T), (|2]i and ([3]). Specifically we study a practical 
setting where users use i\/-ary QAM modulation of which the 



bit error rate is 
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Fig. 2: Price of Anarchy as a function of N 

In Figure |2] we plot the numerical result on the PoA with a 
typical WLAN setting M = 64, a = 1, SNR = 15dB. We also 
report the similar results with other parameter settings. From 
Figure |2] and recall the results on the the performance anormaly 
in 802.1 1 WLANs where a user operating on low data rate can 
drastically degrade the overall network performance, we report 
the following desirable properties of the adapted TFT strategy 
at the rate adaptation level: 

• The PoA is close to 1 in the studied cases, signifying 
that the adapted TFT strategy can actually lead the selfish 
users to a reasonable efficient equilibrium, which is shown 
to be unique, despite the user selfishness and the lack of 
coordination among users. 
« The scalability of the adapted TFT strategy is also demon- 
strated in that the PoA decreases with N and tends to 
around L02. 

We conclude this section with a discussion on the distributed 
implementation of the best-response strategy to converge to the 
unique equilibrium. To compute the best-response strategy, each 
user should locally compute ( fTTI ). To that end, each player 
can estimate the SNR by adding a feedback in the ACK 
packets indicating the power at the receiver and compare it 
with the emission power to further derive ei{Ri) and Gi{Ri). 
Besides, each player can calculate JljeA'' Pi based on local 
observation by using the approach proposed in Il25ll . Q based 
on observing the average number of consecutive idle slots 
between two transmission attempts. Since each user knows its 
own strategy, he can compute Ai{Ri) in (fTTI) locally. Hence, 
the NE can be achieved based on solely local observation. 

VII. Equilibrium Analysis of Higher-level medium 
ACCESS Game and the Global Game 

In this section, we investigate the higher-level medium access 
game under the adapted TFT framework given that once players 
choose their CW strategy, they play the lower-level rate adapta- 
tion game by operating on the resulting unique equilibrium of 
the game. In the following analysis, we first show the existence 
of equilibrium for the medium access game, which is also the 
system equilibrium. Since there may exist multiple equilibria, 
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we introduce the equilibrium refinement process to find the 
most efficient one. We then develop a distributed algorithm to 
converge to the refined equilibrium. We also study the efficiency 
of this equilibrium. 

Before delving into the equilibrium analysis, the following 
lemma studies a fundamental property by showing that given 
any strategy of other players, any player i is always better off 
by decreasing its CW value Wi. 

Lemma 5. Under the non-atomic game assumption, given any 
strategy of other player, any player i gets higher utility by 
decreasing its CW value Wi. 

Proof: Given any W-i and any Wi < VF/, we need 
to show that S^{W^,W-^) > S^{W|,W-^). To this end, let 
{qj,Tj,Rj)j^^f denote the system state at {Wi,W-i) with 
{Rj)j£j\f being the equilibrium of the lower-level rate adap- 
tation game and let {qj,Tj, R'j)j^j^f denote the system state at 
{Wl,W-i) with {R'j)j(zj^ being the equilibrium of the lower- 
level rate adaptation game, it follows from the assumption of 
the non-atomic game that the impact of changing Wi and Ri 
on the strategy and state of other players is negligible. Hence 

Now let i operate on data rate R'^ at {Wi,W-i), it fol- 
lows from ([T]) and (O that qi < q[, further leading to 
Ti > t[. Noticing that rj = Tj,Vj e N,j ^ i, we have 
S^{WiW^i) < Si{W^,R'^. Recall the definition that given 
the CW setting (W/, W-i), Ri is the best response to R-i, 
it holds that Si{W^,W-i) < Si{Wi,R.'^). Therefore, we have 

s,{w,,W-,)>s,{wi,w.,). m 

Lemma |5] whose proof is detailed in xx, justifies the pro- 
posed adapted TFT strategy as it is motivated by that smaller 
CW value leads to higher effective throughput. 

A. System Equilibrium Analysis 

The proposed adapted TFT framework can ensure that all 
users operate on the same CW value, i.e., Wj = Wyj E TV. 
For any player i e Af, it can be noted that: 

• When W = cQ, meaning that any player j transmits its 
packets without waiting, we have Si (W) = C@; 

• When W oo, noticing ^ that n < -y^^ 0, it holds 
that §,(W) ^0. 

Hence there exists < W* < oo such that 5i(Wj*) is 
maximized. 

Definition 4. For any player i € Af, let Wi denote the set 
containing all CW values Wi such that for any W < Wi, it 
holds that Si{W) < Si{Wi). 

Recall the adapted TFT strategy, Definiton|4] indicates: 

• when operating at any Wi G Wi, player i has no incentive 
to decrease its CW value; 

^Throughout this section, to make the analysis concise, we use boldface 
letters W to denote a A'-element vector with each element being W, i.e., 
W ^ (Wj = WJ.^M. 

^Recall that Si, defined in Definition [5] denotes the utility of i for the 
higher-level medium access game. 



• when operating at some ^ Wi, player i can always 
find Wi < WI such that ^.(Wi) > Si{W'.), i.e., i is 
always better off by deviating from VK/ to Wi. 

Example 1. Let us see an example to clarify the above 
definition and notation. Consider a scenario where Si{Qi) = 0, 
^i(l)_= 0.1, 5,(2) = 0.15, Si{^) = 0.14, 5,(4) = 0.16 
and Si(W) < 5^(4), for any W > 5. In this example, 
Wi = {0,1,2,4}. When operating on W ^ 3 and W > 5, 
player i can increase its utility by decreasing W to 2 and 4, 
respectively ( note that under the adapted TFT strategy, other 
players will also decrease their CW values correspondingly). 

^Since 5^(0^= and S,(W) > for > 0, we have € 
Wi and 1 e Wt. Recall that W* is the maximizer of ^^(Wj*), it 
follows from Definition g] that W* € Wi. Let W = flieAT Wi, 
W is not empty by containing at least 0,1. We next show in 
Theorem |6] that any strategy profile W with e W is an 
equilibrium of the higher-level medium access game Gm and 
that any other strategy profile cannot be an equilibrium. 

Theorem 6. Under the adapted TFT framework, any strategy 
profile W with W E W is an equilibrium of Gm- Any other 
strategy profile cannot be an equilibrium of Gm. 

Proof: We first show that any player i has no incentive to 
deviate from W where W E W. 

First, it follows straightforwardly from Lemma |5] that i has 
no incentive to increase Wi from W. We now show that i 
has no incentive to decrease Wi from W, either To this end, 
recall the adapted TFT framework, decreasing Wi from W to 
W' < W leads to other players decrease their CW value to 
W'. The system thus operates on W. Recall Definition |4] 
we have 5i(W') < 5i(W). Player i thus has no incentive to 
decrease Wi from W, either It then follows from the definition 
of equilibrium that W is an equilibrium of Gm- 

We then show that any other strategy profile cannot be an 
equilibrium of Gm- Otherwise, if there exists an equilibrium 
W with W' ^ W, then we must have 

5,(w) < 5,(w'), yw < w', vi e jv. 

It then holds that W' e W, which leads to contradiction with 
W' ^ W, which concludes our proof. ■ 

Example 2. To clarify the analysis of Theorem |6| on the 
equilibria ofG^j. Consider an example of Gm with two players 
with the following utility setting: 

. 5i(0) = 0, Si{l) ^ 0.1, Si{2) = 0.15, Si{3) = 0.14, 

5i(4) = 0.16 and Si(W) < 5,(4), for any W > 5; 
. §2(0) = 0, 52(1) = 0.11, 52(2) = 0.16, 52(3) = 0.165, 
52(4) = 0.163, §2(5) = 0.166 and 52(W) < 5^(5), for 
any W > 6; 

In this example, it can be checked that Wi = {0, 1, 2, 4} and 
W2 = {0,1,2,3,5}; hence W = {0,1,2}. Theorem^shows 
that there are 3 equilibria in this example: 0, 1 and 2. It can 
be easily verified that no player has incentive to deviate from 
these equilibria under the adapted TFT strategy and that any 
other strategy profile is not an equilibrium. 
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The following theorem on the equilibrium of the global joint 
medium access and rate control game holds naturally. 

Theorem 7. Under the adapted TFT framework, any strategy 
profile (W,R*(W)) with W e W and R*(W) being the 
unique equilibrium ofGuCW) is an equilibrium of the global 
joint medium access and rate control game G. Any other 
strategy profile cannot be an equilibrium of G. 

Proof: We first show that any player i has no incentive to 
deviate from W* . Consider that i deviates from {W* ,R*{W)) 
to {Wl,R[) with Wi ^ W*. Recall the non-atomic game 
assumption, by operating on R'^ and decreasing from to 
W*, Ti increases and thus pi decreases while pj remains the 
same for j ^ i. Hence Si increases by switching from (VK/, R'l) 
to {W*,R'^}. Therefore, player i has no incentive to increase 
Wi. Moreover, player i has no incentive to decrease Wi to 
some Wl' < W* since under the adapted TFT framework, this 
will push other players to W^' and the system will converge 
to (W{', R*(W")), where R*(W{') is the unique equilibrium 
of Gj\/(W|'). As shown in Theorem |6l the throughput of i 
decreases. 

Moreover, since R*(W) is the unique equilibrium of 
Gfl(W), any player i has no incentive to unilaterally change 
R*, either. 

We then show that any other strategy profile cannot be a 
system equilibrium. Firstly, under the adapted TFT strategy, 
at any equilibrium, the CW values are identical for all players, 
otherwise it follows from Lemma|5]that the players with higher 
CW values have incentive to decrease their CW values. Second, 
the data rate profile must be the unique equilibrium of Gr 
under the correspondent CW setting. Given the above observa- 
tion, it holds that the CW setting of any system equilibrium 
must be an equilibrium of Gm, otherwise if (W, R*(W)) 
with W > W* is a system equilibrium, then for any player 
i, by deviating from W to W*, the system will be dragged to 
(W*,R*(W*)), where it enjoys a higher throughput. It then 
follows from Theorem|6]that any strategy profile with W > W* 
cannot be a system equilibrium. ■ 

Since 0, 1 e W, Theorem Q shows that there always exists at 
least two equilibria in Gm- Generally speaking, among multiple 
equilibria, some are not desirable from the system's perspective. 
This can be illustrated by reexamining Example |2] where among 
the three equilibria, 2 is the most efficient one while corre- 
sponds to system collapse. To address this challenge, a natural 
method is to remove those less efficient equilibria to achieve 
a desirable outcome. This is achieved by the equilibrium 
refinement, explored by the following subsection to find the 
most favorable equilibrium at the system's perspective and to 
approach the refined equilibrium. 

B. Equilibrium Refinement 

We introduce three criteria in the equilibrium refinement 
process: fairness, Pareto optimality and system efficiency. 

Fairness: an equilibrium is fair if the system resource is allo- 
cated fairly among users. In G, all the equilibria (W, R*(W)) 
achieve fairness among players in the sense that the players 



converge to the same CW value and they occupy the channel for 
the same amont of time at any equilibrium. Such user fairness 
is inherently enforced by the proposed adapted TFT framework. 

Pareto optimality: an equilibrium is Pareto optimal if we 
cannot find another equilibrium where the utility of every 
player is higher. Let W* = max^^^^T^^, it holds that only 
(W*,R*(W*)) is guaranteed to be Pareto optimal because 
from Definition 31 we have 

5i(W,R*(W)) < 5',(W*,R*(W*)), W<W*, VieN. (14) 
furthermore, (W*, R*(W*)) is the only Pareto optimum equi- 
librium among the equilibria if for at least one play i, the strict 
inequality holds in (fl4l i. 

System efficiency: following the same analysis on Pareto 
optimality, we can show that ( W* , R* (W* ) ) achieves the 
maximal system throughput among the equilibria. Moreover, if 
for at least one play i, the strict inequality holds in (fT4l i. then 
the system effective throughput at ( W* , R* (W* ) ) is strictly 
higher than any other equilibrium. 

The equilibrium refinement thus leads to a unique efficient 
NE (W*,R*(W*)). Next we study how to approach the 
refined equilibrium. We start by establishing the following 
lemma that bounds W* and leads to more efficient search. 

Lemma 6. Let G™''^ = maxfj^ Gi{Ri). Given any strategy 
profile Wo, denote x* the root of the equation ^il^'i_^y^ = 
it holds that Si{W) < S^(Wo) for W > ^. 
Proof: Recall the formula of Si, noticing that 

(1 - p^) n pi= n pi i[pi<^ li pi^ 



it holds that 

g,(W) = ■ 



Pi) rijgAfj^t 



p,G.(i?*(W)) 



^-Uje^Pj 
< Gi{R*(W)) < G™^, V > 0. 

On the other hand, it can be checked that T{x) — — 



is monotonously decreasing in x and T(0) = 1, T{oo) = 0. 
T{x) — admits a unique solution x* . 

When alf players operate on W, it holds that pj = 1 — tj > 
■|r,Vj e TV. Therefore, W > ^, it holds that 



5'^(Wo) > Gf^'T 



W 



> Tix*) > SdW), 



which completes our proof. ■ 
Lemma|6]implies that operating on CW larger than cannot 
be a system equilibrium. Consequently, when searching the 
efficient equilibrium W* derived from the refinement process, 
if the CW value of players is currently W, it suffices to search 
until minigTi/- Wi where Wi = with x* being the root 

of ^iZ(i-x)^ ~ ^G^^ • -E^^sed on this result, we develop a 
distributed algorithm (Algorithm 1 ) to cooperatively search and 
converge to W*. 

The core idea of the first loop in the algorithm is to have 
a coordinator to synchronize the CW values of players. The 
coordinator can be any player. Then each player i can construct 
Wi based on Definition H] The construction process terminates 
at miiiigA^ wp""^ since from Lemma|6] it suffices to parse CW 
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values until minigTv" VF/""^, which is dynamically updated in 
the algorithm. In the second loop of the algorithm, by applying 
the adapted TFT strategy, the system converges gradually to 
the efficient equilibrium W*. 

As can be noted from Algorithm 1, it can be implemented 
in a distributed fashion and any player can be the coordinator 
to synchronize the CW values. Moreover, all players have 
incentive to participate the cooperative search of the efficient 
equilibrium. This can be shown as follows: it follows from 
the adapted TFT strategy that the system will operate on the 
same CW value for all players; hence the system will operate 
on an equilibrium, otherwise we can find a player who can 
increase its utility by decreasing its CW value; thus the CW 
value is monotonously decreasing and will reach an equilibrium 
because the smallest CW value is an equilibrium; since W* 
is the most efficient equilibrium among the potentially multiple 
system equilibria, any selfish but rational player has incentive 
to operate on W* by participating the cooperative search. 

A note on the robustness of the algorithm. We observe 
via numerical experiments (detailed in Section [Villi that by 
deviating from W* (e.g., due to variation in measurement or 
a large step size instead of 1) and by varying the number of 
players, the user can still achieve reasonably efficient point, 
with at least 80% of the throughput achieved at the efficient 
equilibrium. This observation demonstrates the robustness of 
the algorithm in dynamic scenarios with frequent arrival and 
departure. Such robust feature can significantly facilitate the 
practical implementation of the algorithm. 

C. Efficiency of System Equilibrium 

In this subsection, we investigate the efficiency of the system 
equilibrium (W*,R*(W*)) derived previously. To make our 
analysis tractable, we focus here on the symmetrical scenario 
where the channel conditions are the same for all players. 
Nevertheless, we observe similar results for generic scenarios, 
as detailed in the simulation results in Section IVIIII 

In the symmetrical scenario, it follows from the analysis 
of the previous two subsections that W* = W* ,yi € J\f. 
This means that the refined equilibrium W* is composed of 
the maximizers of all the individual utility function 5"^. Let 
(W°P*,R°P*) denote the system optimum, since W* is the 
refined equilibrium of Gm, we have 

^i(W*,R*(W*)) > 5i(W°P*,R*(W°P*)). 
Noticing the symmetry of players, we have 

E^eJ^ S^{W°P\-R"P^) _ 5,(W°PSR°Pt) 

Ez6A^^^(W*,R*(W*)) ~ S,(W*,R*(W*)) 
^ S'i(W°P*,R°Pt) 

~ S^,(W°PSR*(W°pt))' 
which is the PoA of Gfi(W°P*). This result readily indicates 
that the system level PoA equals to the PoA of the lower-level 
rate adaptation game at W°p*. Since we have demonstrated 
that the PoA of the lower-level rate adaptation game is very 
close to 1, we hence have that the refined equilibrium of the 
global game is also very close to system optimum from a social 
perspective. 



Algorithm 1 Searching the efficient equilibrium W*: executed 
at each player i 



1: Initialization: Set Wt = 0, W^ = {0}, S*™"^ = 0, 
y\^max _ ^ ^jjj ggj. g j-Q ^ small value 

2: Start: Any player I broadcasts a message Start Search 
to start the searching process 

3: loop 

4: if a message SearchStop received then 
5: Quit the loop by going to line 25 
6: end if 

7: if a message StartSearch or IncreaseW received 
or a message StartSearch or IncreaseW sent in 
case i = I then 
8: Increase Wi by 1 and wait a short period of time for 

others to increase their CW values 
9: Count the number of acknowledged packets dur- 
ing a period and measure the average effective 
throughput S^{W,) = 
10: if S, > SY""^ then 

11: Set S*™"^ = Si, W* = W„ W^""^ = ^ where x* 

is the root of the equation \_f^-{_^^„ = g^fe- 

12: Add Wi into Wi 

13: end if 

14: if W, > W.r''^ then 

15: Set Wi = W* 

16: Send a message SearchStop containing W* 

17: Quit the loop by going to line 25 

18: else 

19: \fi = 1 then 

20: Wait sufficient long time for the other players to 

finish the above operations 

21: Send a message IncreaseW 

22: end if 

23: Go to loop 

24: end if 

25: end if 

26: end loop 

27: repeat 

28: Measure CW values of others during a period of time 

29: if minj Wj < Wi — e then 

30: Set Wi to the largest element in W,; that is smaller 

than Wi 

31: end if 

32: until miiij Wj >Wi-e 
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VIII. NUMERICAL EXPERIMENTS 

In this section, we present a suite of numerical experiments to 
evaluate the proposed adapted TFT strategy by demonstrating 
and validating some of the theoretical results of the joint 
medium access and rate adaptation game studied in the paper, 
especially for the cases that we are not able to investigate 
analytically. Specifically, we focus on several scenarios indica- 
tive of the typical interactions among the players in the game, 
starting with the symmetrical case with homogeneous players, 
continuing with a more sophisticated scenario with two classes 
of homogenous players, and finally considering the generic 
asymmetrical scenario with heterogeneous players randomly 
parameterized. In particular, we investigate the structure of 
equilibrium of these scenarios and compare it to the system 
optimum. 

A. Symmetrical Scenario 

We start by analyzing the symmetrical scenario with homo- 
geneous players. To this end, we simulate a standard 802.11 
WLAN of N homogeneous users operating on 64 QAM with 
a = 1, SNR = 15dB (please refer to ([T3]l). We set a large 
rate range with Rm.in.i = 1Mbps and Rmax,i = 100Mbps. 
Table U and Figure [3] compare the efficient system equilibrium 
found by applying Algorithm 1 to the global optimum. By 
comparing the players' equilibrium strategy and the optimal 
strategy, we observe that players are slightly more aggressive 
by using smaller CW value and lower data rate leading to less 
transmission error However, as suggested by the simulation 
results, the efficiency loss of the system due to players' 
selfishness is very small, which demonstrates that the proposed 
adapted TFT framework can bring about a reasonably efficient 
equilibrium with only a small system utility loss. This result 
is especially meaningful given the result on the performance 
anormaly of multi-rate 802.11 WLAN where users tend to 
operate on low data rate lfT2ll . 

Figure |4] further studies the robustness of Algorithm 1 
searching the efficient system equilibrium. To this end, we focus 
on the case = 12 where the efficient system equilibrium is 
W* ~ 220 and we study the case where the system does not 
operate exactly on the equilibrium: (1) the system operates on 
the CW around W* varying from 100 to 500, this may be 
due to the choice of a large step size in the algorithm; (2) 
there are users departing and /or entering without rerunning 
the algorithm, this is to simulate the case where the algorithm 
is not run too frequently, we thus study the system efficiency 
by varying the number of players. We report from the result of 
Figure |4] that even in the case where the system cannot operate 
on the exact equilibrium, the system can still achieve at least 
85% the optimal utility, which demonstrates the robustness of 
Algorithm 1. This robust and tolerant feature can significantly 
facilitate the implementation of the algorithm. 

B. Scenario with two Classes of Players 

We proceed to study a more sophisticated scenario composed 
of two classes, namely L and H, of homogeneous players whose 
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140 
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430 


655 


905 



TABLE I: System equilibrium vs. global optimum 
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Fig. 3: System equilibrium vs. global optimum 

SNR are SNRl = 10 and SNRh = 20, respectively. The 
other parameters are set to the same values as before. The 
results, shown in Figure |5] and Table [III demonstrate again that 
although slightly more aggressive due to individual selfishness, 
the efficient system equilibrium is very close to the system 
optimum from a social perspective. 
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12 


16 


20 


24 




70 


130 


190 


335 


550 


810 




80 


160 


285 


470 


665 


900 



TABLE II: System equilibrium vs. global optimum 



C. Asymmetrical Scenario 

We finally consider various heterogeneous scenarios with 
asymmetrical players. More specifically, we simulate a WLAN 
of N players, each with a SNR randomly chosen from [5, 25]. 
For each N, we run 100 simulations with random SNR and plot 
the average Price of Anarchy in Figure |6] In the simulation, 
we make the following observations: (1) the convergence to a 
system equilibrium is always achieved at the rate adaptation 
level under a given CW value, which confirms our theoretical 



s 
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Fig. 5: System equilibrium vs. global optimum 

results; (2) the PoA remains small, with the average value 
bounded by 1.12 and not exceeding 1.25 in any simulated case, 
which demonstrates the efficiency of the proposed adapted TFT 
strategy in generic scenarios; (3) the PoA is decreasing in the 
number of players which shows the good scalability of the 
proposed mechanism. 
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IX. Conclusion 

In this paper, we have investigated the joint rate adapta- 
tion and medium control in WLANs from a non-cooperative 
game theoretic perspective. We have developed an adapted 
TFT strategy to orient the network to an efficient equilibrium 
where users can jointly configure their CW size and data rate 
selfishly. We have formulated the interactions among selfish 
users under the adapted TFT framework as a non-cooperative 
joint medium access and rate adaptation game. By analyzing 
the structural properties of the game, we have provided insights 
on the interaction between rate adaptation and 802.11 medium 
access in competitive setting. We have shown that the game 
has multiple equilibria, which, after the equilibrium refinement 
process that we develop, reduce to a unique efficient one. 
We have developed a distributed algorithm to achieve this 
equilibrium and demonstrated that the equilibrium achieves the 
performance very close to the system optimum from a social 
perspective. 
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